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<160> 11 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2954 
<212> PRT 
<213> Xenopus sp. 



<220> 

<223> Xenopus centromere-associated pr,otein-E (XCENP-E) 
member of the kinesin superfamily of microtubule 
motor proteins 

<220> 

<221> DOMAIN 
<222> (1) . . (472) 

<223> kinesin like motor domain 



<220> 

<221> DOMAIN 

<222> (473) . . (2752) 

<223> rod domain 

<220> 

<221> DOMAIN 

<222> (2753) . . (2954) 

<223> tail domain 



Me^Ser Glu Gly Asp Ala Val Lys Val Cys Val Arg Val Arg Pro Leu 
I 5 10 15 



He 



Gin Arg Glu Gin Gly Asp Gin Ala Asn Leu Gin Trp Lys Ala Gly 



20 



25 



Asn Asn Thr lie Ser Gin Val Asp Gly Thr Lys Ser Phe Asn Phe Asp 
35 40 45 

Arg Val Phe Asn Ser His Glu Ser Thr Ser Gin lie Tyr Gin Glu He 

56 



Ala Val Pro lie 
65 

Phe Ala Tyr Gly 



Thr Pro Asn Ser 
100 

Lys He He Gin 
115 



He Arg Ser Ala 
70 

Gin Thr Ser Ser 
85 

Leu Gly He He 



Glu He Pro Asn 
120 



Leu Gin Gly Tyr 
75 

Gly Lys Thr Tyr 
90 

Pro Gin Ala He 
105 

Arg Glu Phe Leu 



Asn Gly Thr He 
80 

Thr Met Met Gly 
95 

Gin Glu Val Phe 
110 

Leu Arg Val Ser 
125 



Tyr Met Glu He Tyr Asn Glu Thr Val Lys Asp Leu Leu Cys Asp Asp 
130 135 140 

Arg Arg Lys Lys Pro Leu Glu lie Arg Glu Asp Phe Asn Arg Asn Val 
145 150 155 160 

Tyr Val Ala Asp Leu Thr Glu Glu Leu Val Met Val Pro Glu His Val 
165 170 175 

He Gin Trp lie Lys Lys Gly Glu Lys Asn Arg His Tyr Gly Glu Thr 

180 , - 185 " 190 

Lys Met Asn Asp His Ser Ser Arg Ser His thr lie Phe Arg Met He 
195 200 205 

i 

Val Glu Ser Arg Asp Arg Asn Asp Pro Thr Asn Ser Glu Asn Cys Asp 
210 215 220 

Gly Ala Val Met Val Ser His Leu Asn Leu Val Asp Leu Ala Gly Ser 
225 230 235 240 

Glu Arg Ala Ser Gin Thr Gly Ala Glu Gly Val Arg Leu Lys Glu Gly 
245 250 255 

Cys Asn He Asn Arg Ser Leu Phe He Leu Gly Gin Val lie Lys Lys 
260 265 270 

Leu Ser Asp Gly Gin Ala Gly Gly Phe He Asn Tyr Arg Asp Ser Lys 
275 280 285 



Leu Thr Arg lie Leu Gin Asn Ser Leu Gly Gly Asn Ala Lys Thr Val 
290 295 300 

He He Cys Thr lie Thr Pro Val Ser Phe Asp Glu Thr Leu Ser Thr 
305 r\ 310 315 320 

A 

Leu Gin Phe Ala/ Ser Thr Ala Lys His Val Arg Asn Thr Pro His Val 
1 ./ 325 330 335 

Asn Glu Val Leu Asp Asp Glu Ala Leu Leu Lys Arg Tyr Arg Lys Glu 
340 345 350 

lie Leu Asp Leu Lys Lys Gin Leu Glu Asn Leu Glu Ser Ser Ser Glu 
355 360 365 

Thr Lys Ala Gin Ala Met Ala Lys Glu Glu His Thr Gin Leu Leu Ala 
370 375 380 



Glu lie Lys Gin Leu His Lys Glu Arg Glu Asp Arg lie Trp His Leu 
385 390 395 400 

Thr Asn lie. Val Val Ala Ser Ser Gin Glu Ser Gin Gin Asp Gin Arg 
405 410 415 

Val Lys Arg Lys Arg Arg Val Thr Trp Ala Pro Gly Lys lie Gin Asn 
420 425 430 

Ser Leu His Ala Ser Gly Val Ser Asp Phe Asp Met Leu Ser Arg Leu 
435 440 445 

Pro Gly Asn Phe Ser Lys Lys Ala Lys Phe Ser Asp Met Pro Ser Phe 
450 455 460 

Pro Glu lie Asp Asp Ser Val Cys Thr Glu Phe Ser Asp Phe Asp Asp 
465 470 475 480 

Ala Leu Ser Met Met Asp Ser Asn Gly lie Asp Ala Glu Trp Asn Leu 
485 490 495 

Ala Ser Lys Val Thr His Arg Glu Lys Thr Ser Leu His Gin Ser Met 

500 ; ' 505 510 

lie Asp Phe Gly Gin lie Ser Asp Ser Val Gin Phe His Asp Ser Ser 
515 520 525 

i 

Lys Glu Asn Gin Leu Gin Tyr Leu Pro Lys Asp Ser Gly Asp Met Ala 
530 535 540 

Glu Cys Arg Lys Ala Ser Phe Glu Lys Glu lie Thr Ser Leu Gin Gin 
545 550 555 560 

Gin Leu Gin Ser Lys Glu Glu Glu Lys Lys Glu Leu Val Gin Ser Phe 
565 570 575 

Glu Leu Lys lie Ala Glu' Leu Glu Glu Gin Leu Ser Val Lys Ala Lys 
580 585 590 

Asn Leu Glu Met Val Thr Asn Ser Arg Glu His Ser lie Asn Ala Glu 
595 600 605 

Val Gin Thr Asp Val Glu Lys Glu Val Val Arg Lys Glu Met Ser Val 
610 615 620 

Leu Gly Asp Ser Gly Tyr Asn Ala Ser Asn Ser Asp Leu Gin Asp Ser 
625 630 635 640 

Ser Val Asp Gly Lys Arg Leu Ser Ser Ser His Asp Glu Cys lie Glu 
645 650 ' 655 

His Arg Lys Met Leu Glu Gin Lys lie Val Asp Leu Glu Glu Phe lie 
660 665 670 

Glu Asn Leu Asn Lys Lys Ser Glu Asn Asp Lys Gin Lys Ser Ser Glu 
675 680 685 

Gin Asp Phe Met Glu Ser He Gin Leu Cys Glu Ala He Met Ala Glu 
690 ' 695 700 



Lvs Ala Asn Ala Leu Glu Glu Leu Ala Leu Met Arg Asp Asn Phe Asp 
7 J 5 710 715 720 

Asn He He Leu Glu Asn Glu Thr Leu Lys Arg Glu He Ala Asp Leu 
725 730 735 

Glu Arg Ser Leu Lys Glu Asn Gin Glu Thr Asn Glu Phe Glu He Leu 
740 745 750 

Glu Lys Glu Thr Gin Lys Glu His Glu Ala Gin Leu He His Glu He 
755 760 765 

Gly Ser Leu Lys Lys Leu Val Glu Asn Ala Glu Met Tyr Asn Gin Asn 
770 775 780 

Leu Glu Glu Asp Leu Glu Thr Lys Thr Lys Leu Leu Lys Glu Gin Glu 
785 790 795" 800 

He Gin Leu Ala Glu Leu Arg Lys. Arg Ala Asp Asn Leu Gin Lys Lys 
805 810 815 

Val Arg Asn Phe Asp Leu Ser Val Ser Met Gly Asp Ser Glu Lys Leu 
820 825 830 

Cys Glu Glu He Phe Gin Leu Lys Gin Ser Leu Ser Asp Ala Glu Ala 
335 840 845 

Val Thr Arg Asp Ala Gin Lys Glu J Cys Ser Phe Leu Arg Ser Glu Asn 
850 855 860 

Leu Glu Leu Lys Glu Lys Met Glu Asp Thr Ser Asn Trp Tyr Asn Gin 
865 870 875 880 

Lvs Glu Lys Ala Ala Ser Leu Phe Glu Lys Gin Leu Glu Thr Glu Lys 
885 890 895 

Ser Asn Tyr Lys Lys Met Glu Ala Asp Leu Gin Lys Glu Leu Gin Ser 
900 905 910 

Ala Phe Asn Glu He Asn Tyr Leu Asn Gly Leu Leu Ala Gly Lys Val 
915 920 925 

Pro Arg Asp Leu Leu Ser Arg Val Glu Leu Glu Lys Lys Val Ser Glu 
930 935 940 

Phe Ser Lys Gin Leu Glu Lys Ala Leu Glu Glu Lys Asn Ala Leu Glu 
945 950 955 960 

Asn Glu Val Thr Cys Leu Ser Glu Tyr Lys Phe Leu Pro Asn Glu Val 
965 970 975 

Glu Cys Leu Lys Asn Gin He Ser Lys Ala Ser Glu Glu He Met Leu 
980 985 990 

Leu Lys Gin Glu Gly Glu His Ser Ala Ser He He Ser Lys Gin Glu 
995 1000 1005 

He He Met Gin Glu Gin Ser Glu Gin lie Leu Gin Leu Thr Asp Glu 
1010 1015 1020 



• t 



Val Thr His Thr Gin Ser Lys Val Gin Gin Thr Gl« Glu Gin Tyr Leu 
1025 1030 1035 

Glu Met Lys Lys Met His Asp Asp Leu Phe Glu Lys Tyr lie Arg Asn 
1045 1° 50 i 

Lys ser Glu Ala Glu Asp Leu Leu Arg Glu Met Glu Asn Leu Lys Gly 
1060 1055 

Thr Met Glu Ser Val Glu Val Lys lie Ala Asp Thr Lys His Glu Leu 

1075 1° 80 1085 

Glu Glu Thr He Arg Asp Lys Glu Gin Leu Leu His Glu Lys Lys Tyr 

1090 1095 
Phe Phe Gin Ala Met Gin Thr lie Phe Pro lie Thr Pro Leu Ser Asp 
1105 Hl° 1115 

Ser Leu Pro Pro Ser Lys Leu Val Glu Gly Asn Ser Gin Asp Pro lie 
1125 iiJU 

Glu lie Asn Asp Tyr His Asn Leu lie Ala Leu Ala Thr Glu Arg Asn 

1140 . - - ; 11413 

Asn lie Met Val Cys Leu Glu Thr Glu Arg Asn Ser Leu Lys Glu Gin 

1155 1160 
Val He Asp Leu Asn Thr Gin Leu ' G ln Ser Leu Gin Ala Gin Ser He 
1170 U 75 1180 

rir, tu<5 Pro Lvs Gin Asp Leu Glu Glu Gly Glu 
Glu Lys Ser Asp Leu Gin Lys Pro Lys tin y 

1185 II 90 1195 

Val Lys Leu Leu Leu Glu Met Glu Leu Leu Lys Gly His Leu Thr Asp 

1205 121U 

Ser Gin Leu Ser Ue Glu Lys Leu Gin Leu Glu Asn «» 01. VI Thr 

1220 iZZ0 
Glu Lys Leu Gin Thr Leu Gin Glu Glu Met Lys Asn lie Thr lie Glu 

• 1235 1240 
Arg Asn Glu Leu Gin Thr Asn Phe Glu Asp Leu Lys Ala Glu His Asp 

1250 1255 
ser Leu Lys Gin Asp Leu Ser Glu Asn lie Glu Gin Ser He Glu Thr 
1265 1270 12 " 

Gin Asp Glu Leu Ar, ,1a Ale Gin Glu Glu Leu Ar g Glu Gin Lys Gin 

1285 1290 
Leu Val Asp Ser Phe Arg Gin Gin Leu Leu Asp Cys Ser Val Gly He 



1300 



1305 



Ser Ser Pro Asn His Asp Ala^al Ala Asn Gin Glu^ys Val Ser Leu 



1315 



1320 



Gly Glu Val Asn Ser Leu Gin Ser Glu Met Leu Arg Gly Glu Arg Asp 



1330 



1335 



m 



Glu Leu Gin Thr Ser Cys Lys Ala Leu Val Ser Glu Leu Glu Leu Leu 
1345 1350 1355 1360 

Arg Ala His Val Lys Ser Val Glu Gly Glu Asn Leu Glu lie Thr Lys 
1365 1370 1375 

Lys Leu Asn Gly Leu Glu Lys Glu lie Leu Gly Lys Ser Glu Glu Ser 
1380 1385 1390 

Glu Val Leu Lys Ser Met Leu Glu Asn Leu Lys Glu Asp Asn Asn Lys 
1395 1400 1405 

Leu Lys Glu Gin Ala Glu Glu Tyr Ser Ser Lys Glu Asn Gin Phe Ser 
1410 1415 1420 

Leu Glu Glu Val Phe Ser Gly Ser Gin Lys Leu Val Asp Glu lie Glu 
1425 1430 1435 1440 

Val Leu Lys Ala Gin Leu Lys Ala Ala Glu Glu Arg Leu Glu lie Lys 
1445 1450 1455 

Asp Arg Asp Tyr Phe Glu Leu Val Gin Thr Ala Asn Thr Asn Leu Val 
1460 , 1465 1470 

Glu Gly Lys Leu Glu Thr Pro Leu Gin Ala Asp His Glu Glu Asp Ser 
1475 1480 1485 

lie Asp Arg Arg Ser Glu Glu Met *Glu lie Lys Val Leu Gly Glu Lys 
1490 1495' 1500 

Leu Glu Arg Asn Gin Tyr Leu Leu Glu Arg Leu Gin Glu Glu Lys Leu 
1505 1510 1515 1520 

Glu Leu Ser Asn Lys Leu Glu He Leu Gin Lys Glu Met Glu Thr Ser 
1525 1530 1535 

Val Leu Leu Lys Asp Asp Leu- Gin Gin Lys Leu Glu Ser Leu Leu Ser 
1540 1545 1550 

Glu Asn He He Leu Lys Glu Asn He Asp Thr Thr Leu Lys His His 
1555 1560 1565 

Ser Asp Thr Gin Ala Gin Leu Gin Lys Thr Gin Gin Glu Leu Gin Leu 
1570 1575 1580 

Ala Lys Asn Leu Ala He Ala Ala Ser Asp Asn Cys Pro lie Thr Gin 
1585 1590 1595 1600 

Glu Lys Glu Thr Ser Ala Asp Cys Val His Pro Leu Glu Glu Lys He 
1605 1610 1615 

Leu Leu Leu Thr Glu Glu Leu His Gin Lys Thr Asn Glu Gin Glu Lys 
1620 1625 1630 

Leu Leu His Glu Lys Asn Glu Leu Glu Gin Ala Gin Val Glu Leu Lys 
1635 1640 1645 



Cys Glu Val Glu His Leu Met Lys Ser Met He Glu Ser Lys Ser Ser 
1650 1655 1660 



Leu 
1665 



01. Ser 'Leu Gin His Gl» Lys His Asp Thr CI. Gin Gin Leu Leu 

1670 15/:> 

T rin rin Met Gin Val Val Thr Gin Glu Lys Lys Glu Leu 
Ala Leu Lys Gin Gin Met « 1695 

1685 1690 
G l„ Gin Thr His Gin His Len Thr »£ Glu Val Asp HI.™ Lys Glu 



1700 



Asn He 



Glu Leu Gly Leu Asn^ Lys Asn Glu Ala^ln Gin Lys Thr 



1715 



Thr Lys Glu Gin cys Leu Leu Asn Glu Asn Lys Glu Leu Glu Gin Ser 



1730 



1735 



rln rv _ GLu He Glu Glu Leu Met Lys Ser Leu Lys 
Gin His Arg Leu Gln Cys Glu i±e ^j.u i7gQ 



1745 



1750 



Asp Lys Glu S« Ala Leu Glu Thr. Leu^ 



Leu Lys Glu Ser Glu Gin Lys Val 



1765 



1775 



rm rui Met Glu Met Val Met Leu Glu Met Glu Glu 
He Asn Leu Asn Gin Glu Met uxu i Q 

1780 ' .• 178:5 

nn'ara Thr Val He Ala Glu Arg Asp Gin Leu Gin 
Leu Lys Asn Ser Gin Arg Thr a i805 



1795 



1800 

Val Glu Met Ser lie Glu Thr Gin A'sp Asp 

1820 



Asp Asp Leu Arg Glu Ser Va 

1810 1815 
Leu Arc Lys Ala Gl^Glu Ma L,u Gin Gln^ln Lys Asp Lys V.l«» 

l^Leu Thr Ser Gin II. Ser Val Leu Sin Glu Lys ile Ser Leu Leu 

1845 18bU 

uai Ala Thr Val Lys Glu Thr Leu Ser 
Glu Asn Gin Met Leu Tyr Asn Val^Ala Thr 1870 



1860 



acn rln Ser lvs Gin His Leu Phe Ser Glu Ile 
Glu Arg Asp Asp Leu Asn Gin Ser Lys ^ 

1875 1880 



Glu Thr 
1890 



Leu Ser Leu Ser Leu Lys Glu Lys Glu^he Ala Leu Glu Gin 



1895 



AXa Glu Lys Asp Lys Ala Asp Ala Ala Arc Lys Thr lie Asp n.* 
1905 1910 " 

slu Lys Lie Ser Asn He G!u Glu Gin L«u Leu Gin Gin Ala^r Asn 

1925 19jU 
Leu Lys Glu Thr Leu Tyr Glu Ar^Glu Ser Leu Xle Gln^ys Lys Glu 



1940 



rin Leu Arq Glu Thr Leu Lys Ser Lys 

Gin Leu Ala Leu Asn Thr Glu His Leu Arg b ^ 

1955 . iybU 

T.n riv Lvs Met Glu Gin Glu Arg Asp Glu Ala Ala Asn 
Asp Leu Ala Leu Gly Lys Met u 

1970 



Lys Val lie Ala Leu Thr Glu Lys Met Ser Ser Leu Glu Glu Gin lie 

1985 1990 1995 2000 

Asn Glu Asn Val Thr Thr Leu Lys Glu Gly Glu Gly Glu Lys Glu Thr 

2005 2010 2015 



Phe Tyr Leu Gin Arg Pro Ser Lys Gin Gin Ser Ser Ser Gin Met Glu 
2020 2025 2030 

Glu Leu Arg Glu Ser Leu Lys Thr Lys Asp Leu Gin Leu Glu Glu Ala 
2035 2040 2045 

Glu Lys Glu lie Ser Glu Ala Thr Asn Glu lie Lys Asn Leu Thr Ala 
2050 2055 2060 

Lys lie Ser Ser Leu Glu Glu Glu lie Leu Gin Asn Ala Ser .lie Leu 
2065 2070 2075 2080 

Asn Glu Ala Val Ser Glu Arg Glu Asn Leu Arg His Ser Lys Gin Gin 
2085 2090 2095 

Leu Val Ser Glu Leu Glu Gin Leu Ser Leu Thr Leu Lys Ser Arg Asp 
2100 2105 2110 

His Ala Phe Ala Gin Ser Lys Arg Glu Lys Asp Glu Ala Val Asn Lys 
2115 2120 2125 

1 

lie Ala Ser Leu Ala Glu Glu lie Lys lie Leu Thr Lys Glu Met Asp 
2130 2135 2140 

Glu Phe Arg Asp Ser Lys Glu Ser Leu Gin Glu Gin Ser Ser His Leu 
2145 2150 • 2155 2160 

Ser Glu Glu Leu Cys Thr Tyr Lys Thr Glu Leu Gin Met Leu Lys Gin 
2165 2170 2175 

Gin Lys Glu Asp lie Asn Asn Lys Leu Ala Glu Lys Val Lys Glu Val 
2180 2185 2190 

Asp Glu Leu Leu Gin His Leu Ser Ser Leu Lys Glu Gin Leu Asp Gin 
2195 2200 2205 

lie Gin Met Glu Leu Arg Asn Glu Lys Leu Arg Asn Tyr Glu Leu Cys 
2210 2215 2220 

Glu Lys Met Asp lie Met Glu Lys Glu lie Ser Val Leu Arg Leu Met 
2225 2230 2235 2240 

Gin Asn Glu Pro Gin Gin Glu Glu Asp Asp Val Ala Glu Arg Met Asp 
2245 2250 2255. 

He Leu Glu Ser Arg Asn Gin Glu He Gin Glu Leu Met Glu Lys He 
2260 2265 2270 

Ser Ala Val Tyr Ser Glu Gin His Thr Leu Leu Ser Ser Leu Ser Ser 
2275 2280 2285 

Glu Leu Gin Lys Glu Thr Glu Ala His Lys His Cys Met Leu Asn He 
2290 2295 2300 



o c ar Thr Leu Ser Arg Ser Phe Gly Ser Leu Gin 
Lys Glu Ser Leu Ser Ser Thr Leu ber a 232Q 



2305 2310 



Thr Glu His val Lys Leu Asn Thr Gin Leu Gin Thr Leu Leu^sn Lys 
2325 

Thr a1 _ A i a val Lys Glu Asp His Ser Leu 
Phe Lys Val Val Tyr Arg Thr Alalia Val by v ^ q 



2340 

t ^ AsD Leu Ala Ala Glu Gin Lys Arg His Asp 
He Lys Asp Tyr Glu Lys Asp Leu ^ 

2355 ^ J0U 

t nn Leu Gin Cys Leu Glu Gin His Gly Arg Lys Trp 
Glu Leu Arg Leu Gin Leu bin ^ 

2370 

3e r Asp Ser Ma S« Olu 01» Leu ly . -Cys ». - - - 

2385 2390 

„ B Glu Leu Leu^e LV. ^ Ue S " Val »» " P 

Rsp P „e Ser Oln Val Phe Leu - % £ ^ ^ 

2420 • 

S ln Glu^lu - OXuBi. Ly*. Gly «» «« ^ ^ 

Phe cJlZ L.a » is ».i »p >*■ - - Glu Gly ^ 

2450 2455 

„„ 61 u ». w « ; s « Thr Leu leu Thr L, %"» 

-a Valval Gin Ser Lys lUCln *«, *" ™ ™ * 

i-i « nn ivs Lvs Glu Gin Asn Lys 
Leu Asn Gin Phe Glu Ala Lys Leu^ln Glu Lys Ly ^ 

- 2500 zou ° 
• eu L eu „e t *r, «, «e< CI. Hi. Hi= Gly Pro Ser M, - -1 « 

2515 Z3<iU 

■ S1B Glu Glu *»„ -a AT, Leu Leu «y « & - ™ 

2530 

01. Ser Ly. Lys Leu Gin Ser «, II. Ly. £t Leu Glu Asn Glu^eu 

2545 2550 

fts „ u, v.! Lys «p G1J G1 " ^ "^"l 

2565 " 

or Rra Asn Ala Glu Ala Glu Leu Asn Ala 
Leu Gin Asp Lys Leu Leu Ser ^Asn * ^ 



„et 01. val Lys Leu Thr Lys Lys Oln Asp Asn Leu^ln Ma Ma ««t 
2595 

T rm Lvs Met Val Ala Lys Gly Ala Val Pro 
Lys Glu He Glu Asn Leu Gin Lys Met ^ 

2610 " 13 



(1/ 



■•■10 • 



Tyr Lys Glu Glu lie Asp Asn Leu Lys Thr Lys Val Val Lys lie Glu 
2625 2630 2635 2640 

Met Glu Lys lie Lys Tyr Ser Lys Ala Thr Asp Gin Glu lie Ala Tyr 
2645 2650 2655 

Leu Lys Ser Cys Leu Glu Asp Lys Glu Glu Gly Leu Arg Arg Leu Lys 
2660 2665 2670 

Glu Glu Leu Arg Arg Ala Gin Ala Asp Asn Asp Thr Thr Val Cys Val 
2675 2680 2685 

Pro Lys Asp Tyr Gin Lys Ala Ser Thr Phe Pro Val Thr Cys Gly Gly 
2690 2695 2700 

Gly Ser Gly lie Val Gin Ser Thr Ala Met Leu Val Leu Gin Ser Glu 
2705 2710 2715 2720 

Lys Ala Ala Leu Glu Arg Glu Leu Ser His Tyr Lys Lys Lys Tyr His 
2725 2730 2735 

His Leu Ser Arg Thr Met Ser Ser Ser Glu Asp Arg Lys Lys Thr Lys 
2740 . - , 2745 2750 

Ala Lys Ser Asp Ala His Ser Ser His Thr Gly Ser Ser His Arg Gly 
2755 2760 2765 

Ser Pro His Lys Thr Glu Thr Tyr Arg His Gly Pro Val Thr Fro Glu 
2770 2775 2780 

Arg Ser Glu Met Pro Ser Leu His Leu Gly Ser Pro Lys Lys Ser Glu 
2785 2790 2795 2800 

Ser Ser Thr Lys Arg Val Val Ser Pro Asn Arg Ser Glu lie Tyr Ser 
2805 2810 2815 

Gin Leu Val Met Ser Pro Gly Lys Thr Gly Met His Lys His lie Leu 
2820 2825 2830 

Ser Pro Ser Lys Val Gly Leu His Lys Lys Arg Ala Leu Ser Pro Asn 
2835 2840 2845 

Arg Ser Glu Met Pro Thr Gin His Val lie Ser Pro Gly Lys Thr Gly 
2850 .2855 2860 

Leu His Lys Asn Leu Thr Glu Ser Thr Leu Phe Asp Asn Leu Ser Ser 
2865 2870 2875 2880 

Pro Cys Lys Gin Gin Lys Val Gin Glu Asn Leu Asn Ser Pro Lys Gly 
2885 2890 2895 

Lys Leu Phe Asp Val Lys Ser Lys Ser Met Pro Tyr Cys Pro Ser Gin 
2900 2905 2910 

Phe Phe Asp Asn Ser Lys Leu Gly Asp Phe Ser Glu Leu Asn Thr Ala 
2915 2920 2925 



Glu Ser Asn Asp Lys Ser Gin Ala Glu Asn Trp Trp Tyr Glu Ala Lys 
2930 2935 2940 



(el 




Lys Glu Thr Ala Pro Glu Cys Lys Thr Ser 
2945 2950 



<210> 2 
<211> 9626" 
<212> DNA 
<213> Xenopus sp. 

<220> 

<223> XCENP-E nucleotide sequence 

<220> 
<221> CDS 

<222> (143) . . (9007) 
<400> 2 

gaattccgga gtcggatagg ctagtcgggc gagggaaatt caaactggtt atagaagaaa 60 
cttgaaccgc cgccaaaagg gactaaagt.g acagagacag ggagcggtgt cggtaccgat 120 
ttctccacta atcggtctca aaatgtccga 'gggagatgca gttaaagtgt gtgtgagggt 180 
tcggccgctt atacagagag aacaagggga tcaagccaac ctgcaatgga aggctggaaa 240 
caacaccatt tcccaagttg atgggacaaa gtctttcaat ttcgatcgtg tatttaattc 300 
tcacgaatca acaagtcaaa-t-ttaccaaga aatagcagta cctatcatac gatcagcttt 360 
gcagggatat aatggcacaa tatttgcata cggacagaca tcttcaggca agacgtacac 420 
aatgatggga acaccaaatt cattgggcat aataccccaa gccatacagg aagtttttaa 480 
aattattcag gagataccga acagagagtt tcttctaaga gtttcttata tggagattta 540 
caatgaaact gtgaaagacc tactgtgtga^ tgacagaaga aagaagccct tggaaattcg 600 
cgaggatttt aatagaaacg tgtatgttgc tgacctgact gaagaacttg ta'atggttcc 660 
tgaacatgta atacagtgga tcaaaaaggg tgaaaaaaac agacattatg gagagactaa 720 
aatgaatgat catagtagtc gttcacatac aatatttaga atgattgttg aaagccgaga 780 
cagaaatgat cccacaaatt cagagaactg tgatggagct gtcatggtat ctcacttgaa 840 
tttggtagat cttgctggca gtgaaagagc aagccaaact ggagctgaag gtgtgagact 900 
taaggaaggc tgcaacatca accgcagctt gtttatcctt ggacaggtta ttaagaagct 960 
tagcgacggc caggctggtg gatttataaa ctacagagac agcaaactca ccagaattct 1020. 
ccaaaattca ttgggaggaa atgctaaaac ggttataatt tgcacaatta cgccagtttc 1080 
ttttgatgag actctaagta cacttcagtt tgccagtact gccaaacatg tgagaaatac 1140 
tccccatgtt aatgaggtcc tggatgatga agcgttgcta aaaaggtaca gaaaggaaat 1200 
cttggattta aagaaacaat tagagaattt agagtcatcg tctgaaacaa aagctcaagc 1260 
aatggctaaa gaagagcata cacagttgct agctgaaatc aaacaactac acaaagagag 1320 
agaagataga atatggcact tgacaaatat tgttgttgct tcatcccaag aatctcaaca 1380 
ggaccaaagg gtcaaacgaa aacgaagagt tacgtgggcg ccaggaaaaa tccaaaatag 1440 
tttacatgct tctggtgttt ctgactttga tatgctatcc agattacctg gcaattttag 1500 
caagaaggca aagttctctg acatgccttc atttccagaa attgatgact ctgtttgtac 1560 
agagttttct gattttgatg acgccctctc catgatggac agcaatggaa tagatgcaga 1620 
atggaattta gccagtaaag taactcacag agaaaagaca tcacttcatc aatcaatgat 1680 
agactttgga cagatttctg acagtgttca gtttcatgat tcttctaagg aaaaccagct 1740 
acaatacctc cccaaagact ctggtgatat ggctgaatgc agaaaagctt cttttgaaaa 1800 
agagatcaca agcctccagc aacaactaca gtcaaaggag gaagaaaaaa aggaacttgt 1860 
acaaagcttc gagctcaaga tagcagaact ggaagagcag cttagtgtca aagctaaaaa 1920 
tctagagatg gttacaaact cgagagagca ttccattaat gctgaagtcc aaacagatgt 1980 
tgaaaaggaa gttgtgagaa aagaaatgtc agtccttgga gactctggtt acaatgcatc 2040 
aaacagtgac ctacaggata gttctgttga tggtaaacgt ctaagcagct cccatgatga 2100 
gtgtatagaa cacagaaaaa tgctggaaca aaagatcgtt gatttagaag agtttattga 2160 
aaaccttaac aagaaaagtg agaatgataa acaaaaatct tctgagcaag attttatgga 2220 
gagtattcag ctatgtgaag ctataatggc agaaaaggca aatgcactgg aggaactggc 2280 
acttatgcga gataattttg acaatattat tctagagaat gaaactctaa aaagggaaat 2340 
tgcagatctg gaacgttcac tcaaggagaa tcaagaaacc aatgagtttg aaattctgga 2400 
gaaggaaact caaaaagaac acgaggcaca actaatccat gagattggca gtttaaagaa 24 60 
attagttgaa aatgcagaga tgtacaatca aaatcttgag gaagatctag aaactaaaac 2520 
aaaacttctg aaagagcagg aaattcaact tgcagaatta aggaaacgcg cagataactt 2580 
gcagaaaaaa gtacgaaatt ttgatctctc ggtttccatg ggtgatagtg agaaactctg 2640 



tgaagaaatc tttcaactga agcaatctct ttctgatgct gaagctgtga ctcgcgatgc 2700 
tcagaaggaa tgttctttcc tcagaagtga aaatctagag ctgaaggaga aaatggagga 2760 
cacatcaaac tggtacaatc aaaaagaaaa ggctgcgtct ttgtttgaga agcagctgga 2820 
aactgaaaaa tcaaactaca agaaaatgga agctgatttg cagaaagagt tgcaaagtgc 2880 
ttttaatgag attaactact taaatggcct tctggcagga aaggtcccca gagatttgct 2940 
ttctcgtgtt gaattagaga aaaaggtttc tgagttctca aagcagcttg agaaagcatt 3000 
ggaagaaaaa aatgccttgg agaatgaagt gacttgccta tcagaataca aatttttgcc 3060 
aaatgaagtt gaatgcttga aaaatcagat cagcaaggct tctgaagaga taatgttatt 3120 
aaagcaagaa ggagaacatt ctgcatctat tataagcaaa caagagatta tcatgcagga 3180 
gcaatctgag cagattttac aactgactga cgaagtgaca cacacacagt caaaagtgca 3-240. 
gcagactgaa gagcaatact tggagatgaa gaaaatgcat gatgatcttt ttgaaaagta 3300 
tatcagaaac aaaagtgaag ctgaagacct tttaagagaa atggagaacc ttaaaggcac 3360 
tatggagtct gtggaagtaa agattgctga cacaaaacat gaacttgaag aaactataag 3420 
ggataaggag caactgcttc atgagaaaaa atactttttt caagcaatgc agactatatt 3480 
tccgattaca cctctttcag actcgcttcc tccctcaaaa ttagttgaag ggaactctca 3540 
agaccccata gaaatcaatg actaccacaa tttaatagcc cttgctacag aaaggaacaa 3600 
cattatggtg tgtctagaga ctgaaagaaa cagtctcaag gagcaagtta ttgatttgaa 3660 
cactcaactt caaagtcttc aagcacaaag" tattgaaaag tctgatctcc agaagccaaa 3720 
gcaagacttg gaagaaggag aggttaaact gcttttggag atggaactac tgaagggaca 3780 
cctaactgac tcacagctgt ctatagagaa attgcagctg gagaatctgg aagttacaga 3840 
aaaactccaa acacttcaag aagagatgaa aaatattact atagaaagga atgagcttca 3900 
gaccaacttc gaagacctga aagcagagca tgatagccta aaacaagacc ttagtgaaaa 3960 
cattgagcag tcaattgaaa^qacaagatga attaagggct gcccaggaag agctaagaga 4020 
acagaagcaa ctggttgata gctttagaca acagctttta gattgttctg taggaatttc 4080 
atcaccaaac catgatgcag ttgctaacca ggaaaaggtg tcattgggtg aagttaattc 4140 
gttacaaagt gaaatgctgc gtggtgaaag agatgagctt caaacatctt gtaaggcatt 4200 
agtttcagaa ctggagctac ttcgtgctca tgtaaaatct gtggagggag aaaaccttga 4260 
aatcacaaaa aaactcaatg gccttgaaaa ggagatattg ggcaaatctg aagaaagcga 4320 
agtgttgaag tccatgttgg agaatctaaa ggaagacaac aataagctca aagaacaagc 4380 
agaggaatat tctagtaaag aaaatcaatt cagtttagaa gaggtgttca gtggttcaca 4440 
gaagctggta gatgagatag aggtcctgaa agcacagcta aaggcagcag aagaaaggct 4500 
ggaaataaag gatagagatt attttgaact tgtacaaact gcaaacacca atttagttga 4560 
gggaaaattg gaaactccat tgcaagctga ccatgaggaa gacagcattg atcggcgttc 4 620 
tgaagaaatg gagataaaag ttcttggaga aaaacttgag cgaaatcagt atttactgga 4680. 
aagattgcaa gaagaaaagc tggaactgtc taacaaactt gaaatccttc agaaagagat 4740 
ggagacgtcc gttctattaa aagatgacct gcaacagaag ctagaaagct tgctgagtga 4800 
aaacattatt ctaaaagaga atattgacac aaccctaaag catcattcag atactcaagc 4860- 
tcagctgcag aaaacacagc aagagctaca gttggctaag aatcttgcaa tcgctgcttc 4920 
tgacaattgt ccaataactc aagaaaagga aacctctgca gattgtgtgc atcctctgga 4980 
agaaaagata ttgttattaa ctgaagaatt gcatcaaaaa actaatgaac aggaaaaatt 5040 
actacatgaa aagaatgaac ttgaacaagc tcaggttgag ctaaagtgcg aggtggaaca 5100 
tctgatgaag agtatgatcg aatcgaagtc ctcacttgag tccttgcagc atgagaaaca 5160 
tgatactgaa caacaacttc ttgctcttaa acagcagatg caagtagtta ctcaagaaaa 5220 
gaaagagctg caacaaaccc atgaacactt aacagctgag gtggaccatc taaaagagaa 5280 
tattgaattg ggtttgaatt ttaaaaatga agcgcagcaa aagaccacta aagagcaatg 5340 
tctgctaaat gagaataaag aacttgagca aagccagcac agacttcaat gtgagataga 5400 
agagcttatg aaaagcttaa aggataaaga gtcagcgctg gaaactttaa aagaatctga 54 60 
acaaaaagta attaatctga accaagaaat ggaaatggtt atgctggaaa tggaggaatt 5520 
gaaaaatagc cagaggactg taattgctga gagggaccag ctgcaagacg acctaaggga 5580 
gagtgttgaa atgtccattg aaactcaaga tgatctaaga aaggctcaag aagcattgca 5640 
gcagcagaaa gataaagttc aggaactgac ctcccagatt tctgtcctgc aggaaaagat 5700 
ctctcttctg gaaaatcaga tgctttataa tgttgcaact gtgaaagaaa ctctaagcga 5760 
aagagatgac ctgaaccagt ctaagcaaca cctgttctca gaaattgaaa ctcttagtct 5820 
gtctttaaaa gaaaaggaat ttgcattgga acaagcagag aaggacaaag ctgatgctgc 5880 
caggaaaaca atagatatca cagagaagat atcaaatata gaagaacagt tacttcaaca 5940 
agccaccaat ttaaaggaaa ctctttatga aagagagagt cttatccagt gtaaggagca 6000 
actggctttg aacacagaac accttaggga aacattgaag agcaaagact tggcattggg 6060 
taaaatggag caggagagag atgaagctgc caataaagta atagctctta cagaaaagat 6120 
gtcttctcta gaagaacaga tcaatgaaaa tgttactact ctaaaagaag gtgagggtga 6180 
aaaagagacc ttctaccttc agagaccttc aaagcagcag tcttcttccc agatggaaga 6240 
gctcagagag tctttaaaga ccaaagattt gcagttggaa gaggccgaga aggagataag 6300 




tgaagctact aatgaaataa agaatctcac tgctaagatc tcttctctag aggaggagat 6360 
tcttcagaat gctagcattt tgaatgaagc tgtaagcgaa agggaaaacc ttcgccattc 6420 
gaagcagcaa ctggtttcag aattggagca gctatcactg acattaaaga gtagagacca 6480 
tgcatttgcg caatctaaac gcgaaaagga tgaagctgta aataaaatag ccagtctcgc 6540 
tgaagaaata aagatcctga caaaagagat ggatgaattc agagattcaa aggaatcctt 6600 
gcaagaacag tcttcccatc taagtgaaga gttatgtaca tataagactg aacttcaaat 6660 
gctcaagcaa cagaaagaag acatcaacaa caaacttgca gagaaagtta aggaagtgga 6720 
tgagctattg caacacttat catctctaaa ggaacagctg gaccaaatac agatggagct 6780 
aaggaatgaa aagctcagaa actatgaact ctgcgaaaag atggatatca tggaaaaaga 6840 
aatctcagtg ctgcgtttaa tgcagaacga gcctcagcag gaagaagatg atgttgcaga 6900 
acgtatggat atacttgaga gcagaaacca agaaatacag gagctgatgg aaaaaatctc 6960 
cgctgtgtat tcagagcaac acactttgct cagcagtctc tctagtgagc ttcaaaagga 7020 
aactgaagca cacaaacatt gcatgttaaa tataaaggaa tctctgtcat ccacgctctc 7080 
cagatccttt ggcagcttgc aaactgagca tgttaagcta aatactcaac tgcagaccct 7140 
tctgaacaaa tttaaggttg tataccgaac tgctgcagtc aaagaagatc atagcttgat 7200 
caaagattat gagaaggacc ttgctgctga gcaaaagagg catgatgagc tgcgactcca 7260 
actgcagtgt ttggagcagc acggcagaaa atggtcggat tctgcatctg aggaactcaa 7320 
gttctgtgaa attgaattct tgaatgagtt- actttttaaa aaagcaaata taattcagag 7380 
tgtccaggat gacttttcag aggtgcaggt att-cctaaat caagtaggat caacactgca 7440 
agaagagctt gagcacaaga aaggctttat -gcagtggttg gaggaatttg gagatctgca 7500 
cgtcgatgct aagaaactca gtgaaggcat gcaacaggaa aataggcgca ttgcttctac 7560 
catacagctc ttaacaaaaa ggctaaaggc agttgttcag tcgaaaatac aacgtgagat 7620 
aaccgtatat ctgaaccagt -ttgaagcgaa attgcaagag aagaaagaac aaaacaaaga 7680 
acttatgcgc agaatggagc accacggccc tagtgctagt gtaatggagg aagaaaatgc 7740 
tagactttta ggcatactga aaactgttca agatgaatcc aagaaactcc aatcaaggat 7800 
caaaatgcta gaaaatgaac tgaacttggt caaagatgat gccatgcaca aaggtgaaaa 7860 
agttgcaatt ttgcaagaca aactactaag cagaaacgcc gaagctgagc taaacgcaat 7920 
gcaggtgaaa ctaactaaaa agcaagata^ tcttcaggct gcaatgaaag aaatagaaaa 7980 
cctacagaaa atggttgcca aaggtgcagt accatataaa gaagaaattg acaaccttaa 8040 
aactaaggtg gtaaagattg aaatggaaaa aataaagtac tcaaaagcaa cagaccaaga 8100 
gattgcctac ttaaagtctt gtttggaaga taaggaagaa ggcttgcgta ggttaaaaga 8160 
ggaacttagg cgagcacagg cagacaacga tacaacagtt tgtgttccaa aagattatca 8220 
gaaagcttca actttccctg tgacttgtgg tggtggaagc ggtatagtgc agagcacagc 8280 
aatgcttgtg ctgcagtcgg aaaaagccgc cttggaaagg gagctgtcac attataagaa 8340 
gaaatatcat catttatcac ggactatgtc aagttctgaa gatcgaaaga aaacaaaggc 8400 
aaaatctgat gctcattctt ctcatactgg atcatcacac agaggctcgc ctcacaaaac 84 60 
tgaaacttac agacatggcc ctgttactcc agaaaggtct gaaatgccaa gcctacacct 8520. 
aggatctcca aagaagtcgg agtccagcac taaacgtgtt gtgtcaccaa acaggtccga 8580 
aatctacagc caattagtaa tgtctccagg caagaccggg atgcataaac atatactttc 8 64 0 
tccaagcaag gttggactgc acaaaaagcg tgccctgtct ccaaacagat cggagatgcc 8700 
cacccagcat gtcatatccc ctggcaagac cggactgcat aaaaatctaa ctgaaagcac 8760 
gttattcgac aatttgtctt ctccatgcaa acagcaaaaa gtacaggaaa atctaaattc 8820 
ccctaaaggc aaattatttg atgtgaagtc aaaatcgatg ccttactgtc catctcagtt 8880 
ttttgataat tctaagcttg gtgatttttc agagctcaac acagcagaga gcaatgacaa 8940 
aagtcaggct gagaactggt ggtatgaagc aaaaaaagaa acggcacctg aatgtaaaac 9000 
atcctagatc cctgtacatc tgactctcct gtcctgcaaa gagacttgct actctgcctt 9060 
cttgtaggaa gaaacactag aaactgccat gtctgcataa aggagtctca ctggaagcaa 9120 
aagttgttct ttagtagtaa tcactggttg ggcgagtggt tacgtcttta aaataaagtg 9180 
caatacgtct cacgtctatt ttatatgtta tgtctgtgta tttgttacac ttttaagtcc 9240 
cttgacttca tatttggctc atctgtagtt tctttgtgtt tgcgcacaca cacactggtg 9300 
aaaaatgaca tttgcagtgt attgttttac tgactggtct ctctggggcc atccatgcaa 9360 
agcaccatta gtgtgccaat gtttttcact acttattatt atgtctgact ttgtgaaata 9420 
gaaaatacta caaaagatag ggcaaaagtt gttttctcac taggtaaacg atatgggttt 9480 
aacttaatta ttctcctaat aaaatattct atataatggc atctcattag gtgacctaat 9540 
ggttattctg cactcttgta aacctttttt aaatttcact cgtaataaag cagccctgat 9600 
tttaaattaa aaaaaaaacg gaattc 9626 
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<210> 3 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 9 amino acid 
insertion encoded by one cDNA isolated' 

<400> 3 

Asn Ser Arg Glu His Ser lie Asn Ala 

r 5 



<210> 4 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : putative 
nuclear localization signal (NLS) 

<400> 4 

Arg Lys Lys Thr Lys 
1 5 



<210> 5 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 5 

gggctgccca ggaagag 

<210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 6 

gacagcattg atcggcg 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
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<400> 7 

gagggttcgg ccgctta 



17 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 8 

tctggggcca tccatgc 17 



<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<210> 10 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

gtcaggtcag caacatacac g 



<210> 11 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : sequence linked 
at C-terminus of amino acids 1-473 of XCENP-E in 
plasmid pET23dXCEMycHis 

<220> 

<221> PEPTIDE 

<222> (12) . . (20) 

<223> c-myc epitope tag 

<400> 11 

Thr Val Ser lie Ser Leu Gly Asp Leu Thr Met Glu Gin Lys Leu lie 
1 5 10 15 

Ser Glu Glu Asp Leu Asn Phe Glu His His His His His His 



<400> 9 

catatgacca tggccgaggg agatgcag 



28 



20 



25 



30 




